135 research outputs found
Fast Algorithms for Constructing Maximum Entropy Summary Trees
Karloff? and Shirley recently proposed summary trees as a new way to
visualize large rooted trees (Eurovis 2013) and gave algorithms for generating
a maximum-entropy k-node summary tree of an input n-node rooted tree. However,
the algorithm generating optimal summary trees was only pseudo-polynomial (and
worked only for integral weights); the authors left open existence of a
olynomial-time algorithm. In addition, the authors provided an additive
approximation algorithm and a greedy heuristic, both working on real weights.
This paper shows how to construct maximum entropy k-node summary trees in time
O(k^2 n + n log n) for real weights (indeed, as small as the time bound for the
greedy heuristic given previously); how to speed up the approximation algorithm
so that it runs in time O(n + (k^4/eps?) log(k/eps?)), and how to speed up the
greedy algorithm so as to run in time O(kn + n log n). Altogether, these
results make summary trees a much more practical tool than before.Comment: 17 pages, 4 figures. Extended version of paper appearing in ICALP
201
Visualizing Multivariate Hierarchic Data Using Enhanced Radial Space-Filling Layout
Currently, visualization tools for large ontologies (e.g., pathway and gene ontologies) result in a very flat wide tree that is difficult to fit on a single display. This paper develops the concept of using an enhanced radial space-filling (ERSF) layout to show biological ontologies efficiently. The ERSF technique represents ontology terms as circular regions in 3D. Orbital connections in a third dimension correspond to non-tree edges in the ontology that exist when an ontology term belongs to multiple categories. Biologists can use the ERSF layout to identify highly activated pathway or gene ontology categories by mapping experimental statistics such as coefficient of variation and overrepresentation values onto the visualization. This paper illustrates the use of the ERSF layout to explore pathway and gene ontologies using a gene expression dataset from E. coli
Visual Analysis of Multi-Joint Kinematic Data
Abstract Kinematics is the analysis of motions without regarding forces or inertial effect
The generalized Robinson-Foulds metric
The Robinson-Foulds (RF) metric is arguably the most widely used measure of
phylogenetic tree similarity, despite its well-known shortcomings: For example,
moving a single taxon in a tree can result in a tree that has maximum distance
to the original one; but the two trees are identical if we remove the single
taxon. To this end, we propose a natural extension of the RF metric that does
not simply count identical clades but instead, also takes similar clades into
consideration. In contrast to previous approaches, our model requires the
matching between clades to respect the structure of the two trees, a property
that the classical RF metric exhibits, too. We show that computing this
generalized RF metric is, unfortunately, NP-hard. We then present a simple
Integer Linear Program for its computation, and evaluate it by an
all-against-all comparison of 100 trees from a benchmark data set. We find that
matchings that respect the tree structure differ significantly from those that
do not, underlining the importance of this natural condition.Comment: Peer-reviewed and presented as part of the 13th Workshop on
Algorithms in Bioinformatics (WABI2013
Information Visualisation for Project Management: Case Study of Bath Formula Student Project
This paper contributes to a better understanding and design of dashboards for monitoring of engineering projects based on the projects’ digital footprint and user-centered design approach. The paper presents an explicit insight-based framework for the evaluation of dashboard visualisations and compares the performance of two groups of student engineering project managers against the framework: a group with the dashboard visualisations and a group without the dashboard. The results of our exploratory study demonstrate that student project managers who used the dashboard generated more useful information and exhibited more complex reasoning on the project progress, thus informing knowledge of the provision of information to engineers in support of their project understanding
Recommended from our members
Designing an Exploratory Visual Interface to the Results of Citizen Surveys
Surveys are used by public authorities to monitor the quality and reach of public services and provide information needed to help improve them. The results of such surveys tend to be used in internal reports, with highly-aggregated summaries being released to the public. Even where data are released, many citizens do not have the capability to explore and interpret them. This o ffers limited scope for citizens to explore the results and use them to help hold service providers to account - objectives that are increasingly important in public service provision. We work closely with an English local authority to develop an innovative interactive interface to a citizen survey to demonstrate what can be achieved by applying a visual approach to the exploration of such data. In so doing we (a) make a case for web-based interactive visualisation to make this kind of information accessible both internally to those working in local government and externally to citizens in a way that is not achieved through a regular Open Data release or existing applications; (b) use techniques from both cartography and information visualization to inform the design of fluid visual interactions that enable diverse users - from the casual citizen browser to those interested in more in-depth analysis - to view, compare and interpret the survey outputs from a wide variety of perspectives; and (c) document experiences and reactions to the provision of information in this form, with log analysis playing a role in this exercise. Our reflections on our successes and otherwise will inform future exploratory interface design to help citizens access information and hold public service providers to account
An end-user pipeline for scrapping and visualizing semi-structured data over the Web
The Web is a vast source of semi-structured data sets that are made readily available to support the construction of new knowledge. Information visualization techniques have been demonstrated a suitable alternative for allowing users to analyze and understand a large amount of data. However, the steps required for visualizing semi-structured data obtained from the Web is not straightforward, and it requires proper treatment before information visualization techniques could be applied.
In this work, we present a visualization pipeline for describing the fundamental operations required for visualizing semi-structured data over the Web. For that, we employ Web Scrapping and Web Augmentation techniques for supporting interactive visualizations and solving tasks without changing the context of use of the data. Our approach is duly supported by a framework including scrapping, augmenting and visualization tools and it has been applied to different kinds of websites to demonstrate its validity and feasibility. Our ultimate goal is to expand the limits of our technology for improving the user interaction with websites and creating new experiences for better understanding large data sets
A Visual Analytics Framework Case Study: Understanding Colombia’s National Administrative Department of Statistics Datasets
In a world filled with data, it is expected for a nation to take decisions informed by data. However, countries need to first collect and publish such data in a way meaningful for both citizens and policy makers. A good thematic classification could be instrumental in helping users to navigate and find the right resources on a rich data repository, such as the one collected by the DANE (Departamento Administrativo Nacional de EstadĂstica, i.e. the Colombia’s National Administrative Department of Statistics). The Visual Analytics Framework is a methodology for conducting visual analysis developed by T. Munzner et al.1 that could help with this task. This paper presents a case study applying such framework conducted to help the DANE to better visualize their data repository, and also to understand it better by using another classification extracted from its metadata. It describes the three main analysis tasks identified and the proposed solutions. Usability testing results during the process helped to correct the visualizations and make them adapted to decision-making. Finally, we explained the collection of insights generated from them
Generating Dashboards Using Fine-Grained Components: A Case Study for a PhD Programme
Developing dashboards is a complex domain, especially when several
stakeholders are involved; while some users could demand certain indicators,
other users could demand specific visualizations or design features.
Creating individual dashboards for each potential need would consume several
resources and time, being an unfeasible approach. Also, user requirements must
be thoroughly analyzed to understand their goals regarding the data to be
explored, and other characteristics that could affect their user experience. All
these necessities ask for a paradigm to foster reusability not only at development
level but also at knowledge level. Some methodologies, like the Software
Product Line paradigm, leverage domain knowledge and apply it to create a
series of assets that can be composed, parameterized, or combined to obtain
fully functional systems. This work presents an application of the SPL paradigm
to the domain of information dashboards, with the goal of reducing their
development time and increasing their effectiveness and user experience. Different
dashboard configurations have been suggested to test the proposed
approach in the context of the Education in the Knowledge Society PhD programme
of the University of Salamanca
- …